Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 2093 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 294.5 KiB |
| Average record size in memory | 144.1 B |
Variable types
| NUM | 14 |
|---|---|
| CAT | 3 |
| BOOL | 1 |
Reproduction
| Analysis started | 2020-12-07 12:58:41.011567 |
|---|---|
| Analysis finished | 2020-12-07 12:59:32.136854 |
| Duration | 51.13 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
DisbursementDate is highly correlated with ApprovalFY | High correlation |
ApprovalFY is highly correlated with DisbursementDate | High correlation |
GrAppv is highly correlated with DisbursementGross and 1 other fields | High correlation |
DisbursementGross is highly correlated with GrAppv and 1 other fields | High correlation |
SBA_Appv is highly correlated with DisbursementGross and 1 other fields | High correlation |
CreateJob has 1226 (58.6%) zeros | Zeros |
RetainedJob has 515 (24.6%) zeros | Zeros |
FranchiseCode has 577 (27.6%) zeros | Zeros |
ChgOffPrinGr has 1398 (66.8%) zeros | Zeros |
Zip
Real number (ℝ≥0)
| Distinct count | 810 |
|---|---|
| Unique (%) | 38.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 92698.30673674152 |
|---|---|
| Minimum | 65757 |
| Maximum | 96161 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 65757 |
|---|---|
| 5-th percentile | 90041.6 |
| Q1 | 91402 |
| median | 92557 |
| Q3 | 94124 |
| 95-th percentile | 95682 |
| Maximum | 96161 |
| Range | 30404 |
| Interquartile range (IQR) | 2722 |
Descriptive statistics
| Standard deviation | 1876.575796 |
|---|---|
| Coefficient of variation (CV) | 0.02024390587 |
| Kurtosis | 20.15777062 |
| Mean | 92698.30674 |
| Median Absolute Deviation (MAD) | 1246 |
| Skewness | -1.415556652 |
| Sum | 194017556 |
| Variance | 3521536.718 |
| Value | Count | Frequency (%) | |
| 91910 | 14 | 0.7% | |
| 92101 | 14 | 0.7% | |
| 92618 | 14 | 0.7% | |
| 90010 | 13 | 0.6% | |
| 92562 | 11 | 0.5% | |
| 92701 | 11 | 0.5% | |
| 91730 | 11 | 0.5% | |
| 93401 | 10 | 0.5% | |
| 92109 | 10 | 0.5% | |
| 91364 | 10 | 0.5% | |
| 92108 | 10 | 0.5% | |
| 90045 | 10 | 0.5% | |
| 92069 | 10 | 0.5% | |
| 92660 | 9 | 0.4% | |
| 91352 | 9 | 0.4% | |
| 90066 | 9 | 0.4% | |
| 91941 | 9 | 0.4% | |
| 94070 | 9 | 0.4% | |
| 92121 | 9 | 0.4% | |
| 92037 | 9 | 0.4% | |
| 92807 | 9 | 0.4% | |
| 91405 | 8 | 0.4% | |
| 95131 | 8 | 0.4% | |
| 90004 | 8 | 0.4% | |
| 92844 | 8 | 0.4% | |
| Other values (785) | 1841 | 88.0% |
| Value | Count | Frequency (%) | |
| 65757 | 1 | < 0.1% | |
| 81301 | 1 | < 0.1% | |
| 82037 | 1 | < 0.1% | |
| 85008 | 1 | < 0.1% | |
| 90001 | 1 | < 0.1% | |
| 90003 | 1 | < 0.1% | |
| 90004 | 8 | 0.4% | |
| 90005 | 5 | 0.2% | |
| 90006 | 7 | 0.3% | |
| 90007 | 5 | 0.2% |
| Value | Count | Frequency (%) | |
| 96161 | 7 | 0.3% | |
| 96145 | 2 | 0.1% | |
| 96130 | 1 | < 0.1% | |
| 96120 | 1 | < 0.1% | |
| 96093 | 3 | 0.1% | |
| 96080 | 2 | 0.1% | |
| 96022 | 1 | < 0.1% | |
| 96013 | 1 | < 0.1% | |
| 96002 | 2 | 0.1% | |
| 96001 | 3 | 0.1% |
ICS
Real number (ℝ≥0)
| Distinct count | 24 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 531630.7582417582 |
|---|---|
| Minimum | 531110 |
| Maximum | 533110 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 531110 |
|---|---|
| 5-th percentile | 531190 |
| Q1 | 531210 |
| median | 531312 |
| Q3 | 532230 |
| 95-th percentile | 532490 |
| Maximum | 533110 |
| Range | 2000 |
| Interquartile range (IQR) | 1020 |
Descriptive statistics
| Standard deviation | 522.086575 |
|---|---|
| Coefficient of variation (CV) | 0.0009820473457 |
| Kurtosis | -1.182861806 |
| Mean | 531630.7582 |
| Median Absolute Deviation (MAD) | 102 |
| Skewness | 0.6776118544 |
| Sum | 1112703177 |
| Variance | 272574.3918 |
| Value | Count | Frequency (%) | |
| 531210 | 793 | 37.9% | |
| 532230 | 233 | 11.1% | |
| 531390 | 171 | 8.2% | |
| 531311 | 121 | 5.8% | |
| 532490 | 116 | 5.5% | |
| 531320 | 86 | 4.1% | |
| 532111 | 72 | 3.4% | |
| 532299 | 71 | 3.4% | |
| 531120 | 62 | 3.0% | |
| 531312 | 48 | 2.3% | |
| 532412 | 46 | 2.2% | |
| 532120 | 43 | 2.1% | |
| 532420 | 31 | 1.5% | |
| 532292 | 31 | 1.5% | |
| 532310 | 28 | 1.3% | |
| 531190 | 25 | 1.2% | |
| 532210 | 25 | 1.2% | |
| 531110 | 23 | 1.1% | |
| 533110 | 17 | 0.8% | |
| 532220 | 14 | 0.7% | |
| 532291 | 14 | 0.7% | |
| 532112 | 11 | 0.5% | |
| 532411 | 9 | 0.4% | |
| 531130 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 531110 | 23 | 1.1% | |
| 531120 | 62 | 3.0% | |
| 531130 | 3 | 0.1% | |
| 531190 | 25 | 1.2% | |
| 531210 | 793 | 37.9% | |
| 531311 | 121 | 5.8% | |
| 531312 | 48 | 2.3% | |
| 531320 | 86 | 4.1% | |
| 531390 | 171 | 8.2% | |
| 532111 | 72 | 3.4% |
| Value | Count | Frequency (%) | |
| 533110 | 17 | 0.8% | |
| 532490 | 116 | 5.5% | |
| 532420 | 31 | 1.5% | |
| 532412 | 46 | 2.2% | |
| 532411 | 9 | 0.4% | |
| 532310 | 28 | 1.3% | |
| 532299 | 71 | 3.4% | |
| 532292 | 31 | 1.5% | |
| 532291 | 14 | 0.7% | |
| 532230 | 233 | 11.1% |
| Distinct count | 23 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2004.032967032967 |
|---|---|
| Minimum | 1989 |
| Maximum | 2011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 1989 |
|---|---|
| 5-th percentile | 1995 |
| Q1 | 2003 |
| median | 2005 |
| Q3 | 2007 |
| 95-th percentile | 2008 |
| Maximum | 2011 |
| Range | 22 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.990591545 |
|---|---|
| Coefficient of variation (CV) | 0.001991280388 |
| Kurtosis | 2.948616214 |
| Mean | 2004.032967 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.62594867 |
| Sum | 4194441 |
| Variance | 15.92482088 |
| Value | Count | Frequency (%) | |
| 2007 | 404 | 19.3% | |
| 2006 | 338 | 16.1% | |
| 2005 | 246 | 11.8% | |
| 2004 | 227 | 10.8% | |
| 2003 | 183 | 8.7% | |
| 2002 | 133 | 6.4% | |
| 2008 | 123 | 5.9% | |
| 2001 | 98 | 4.7% | |
| 2000 | 42 | 2.0% | |
| 1999 | 41 | 2.0% | |
| 2010 | 33 | 1.6% | |
| 2009 | 28 | 1.3% | |
| 1991 | 27 | 1.3% | |
| 1998 | 26 | 1.2% | |
| 1995 | 24 | 1.1% | |
| 1997 | 23 | 1.1% | |
| 1996 | 17 | 0.8% | |
| 1990 | 15 | 0.7% | |
| 1994 | 14 | 0.7% | |
| 1993 | 14 | 0.7% | |
| 1989 | 13 | 0.6% | |
| 1992 | 12 | 0.6% | |
| 2011 | 12 | 0.6% |
| Value | Count | Frequency (%) | |
| 1989 | 13 | 0.6% | |
| 1990 | 15 | 0.7% | |
| 1991 | 27 | 1.3% | |
| 1992 | 12 | 0.6% | |
| 1993 | 14 | 0.7% | |
| 1994 | 14 | 0.7% | |
| 1995 | 24 | 1.1% | |
| 1996 | 17 | 0.8% | |
| 1997 | 23 | 1.1% | |
| 1998 | 26 | 1.2% |
| Value | Count | Frequency (%) | |
| 2011 | 12 | 0.6% | |
| 2010 | 33 | 1.6% | |
| 2009 | 28 | 1.3% | |
| 2008 | 123 | 5.9% | |
| 2007 | 404 | 19.3% | |
| 2006 | 338 | 16.1% | |
| 2005 | 246 | 11.8% | |
| 2004 | 227 | 10.8% | |
| 2003 | 183 | 8.7% | |
| 2002 | 133 | 6.4% |
Term
Real number (ℝ≥0)
| Distinct count | 170 |
|---|---|
| Unique (%) | 8.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 127.0831342570473 |
|---|---|
| Minimum | 0 |
| Maximum | 306 |
| Zeros | 3 |
| Zeros (%) | 0.1% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 60 |
| median | 84 |
| Q3 | 240 |
| 95-th percentile | 300 |
| Maximum | 306 |
| Range | 306 |
| Interquartile range (IQR) | 180 |
Descriptive statistics
| Standard deviation | 93.85850772 |
|---|---|
| Coefficient of variation (CV) | 0.7385599062 |
| Kurtosis | -0.8737228175 |
| Mean | 127.0831343 |
| Median Absolute Deviation (MAD) | 36 |
| Skewness | 0.824460891 |
| Sum | 265985 |
| Variance | 8809.419472 |
| Value | Count | Frequency (%) | |
| 84 | 474 | 22.6% | |
| 240 | 265 | 12.7% | |
| 300 | 253 | 12.1% | |
| 120 | 156 | 7.5% | |
| 60 | 78 | 3.7% | |
| 36 | 42 | 2.0% | |
| 59 | 24 | 1.1% | |
| 55 | 21 | 1.0% | |
| 57 | 19 | 0.9% | |
| 63 | 18 | 0.9% | |
| 58 | 17 | 0.8% | |
| 51 | 17 | 0.8% | |
| 61 | 16 | 0.8% | |
| 72 | 16 | 0.8% | |
| 54 | 16 | 0.8% | |
| 64 | 15 | 0.7% | |
| 48 | 14 | 0.7% | |
| 12 | 14 | 0.7% | |
| 62 | 13 | 0.6% | |
| 68 | 13 | 0.6% | |
| 56 | 13 | 0.6% | |
| 42 | 13 | 0.6% | |
| 70 | 13 | 0.6% | |
| 53 | 12 | 0.6% | |
| 96 | 12 | 0.6% | |
| Other values (145) | 529 | 25.3% |
| Value | Count | Frequency (%) | |
| 0 | 3 | 0.1% | |
| 1 | 6 | 0.3% | |
| 2 | 2 | 0.1% | |
| 3 | 2 | 0.1% | |
| 4 | 4 | 0.2% | |
| 5 | 3 | 0.1% | |
| 6 | 1 | < 0.1% | |
| 7 | 1 | < 0.1% | |
| 8 | 7 | 0.3% | |
| 9 | 5 | 0.2% |
| Value | Count | Frequency (%) | |
| 306 | 3 | 0.1% | |
| 305 | 1 | < 0.1% | |
| 304 | 1 | < 0.1% | |
| 303 | 2 | 0.1% | |
| 301 | 1 | < 0.1% | |
| 300 | 253 | 12.1% | |
| 297 | 1 | < 0.1% | |
| 294 | 1 | < 0.1% | |
| 291 | 1 | < 0.1% | |
| 290 | 1 | < 0.1% |
NoEmp
Real number (ℝ≥0)
| Distinct count | 83 |
|---|---|
| Unique (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.162446249402771 |
|---|---|
| Minimum | 0 |
| Maximum | 650 |
| Zeros | 10 |
| Zeros (%) | 0.5% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 8 |
| 95-th percentile | 32 |
| Maximum | 650 |
| Range | 650 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 34.47265084 |
|---|---|
| Coefficient of variation (CV) | 3.392160706 |
| Kurtosis | 187.0350313 |
| Mean | 10.16244625 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 12.29848122 |
| Sum | 21270 |
| Variance | 1188.363656 |
| Value | Count | Frequency (%) | |
| 1 | 448 | 21.4% | |
| 2 | 381 | 18.2% | |
| 3 | 221 | 10.6% | |
| 4 | 159 | 7.6% | |
| 5 | 142 | 6.8% | |
| 6 | 106 | 5.1% | |
| 10 | 73 | 3.5% | |
| 7 | 66 | 3.2% | |
| 8 | 64 | 3.1% | |
| 12 | 48 | 2.3% | |
| 15 | 40 | 1.9% | |
| 13 | 32 | 1.5% | |
| 9 | 31 | 1.5% | |
| 20 | 26 | 1.2% | |
| 11 | 24 | 1.1% | |
| 16 | 20 | 1.0% | |
| 14 | 17 | 0.8% | |
| 25 | 12 | 0.6% | |
| 30 | 11 | 0.5% | |
| 0 | 10 | 0.5% | |
| 18 | 9 | 0.4% | |
| 40 | 9 | 0.4% | |
| 21 | 8 | 0.4% | |
| 17 | 8 | 0.4% | |
| 50 | 7 | 0.3% | |
| Other values (58) | 121 | 5.8% |
| Value | Count | Frequency (%) | |
| 0 | 10 | 0.5% | |
| 1 | 448 | 21.4% | |
| 2 | 381 | 18.2% | |
| 3 | 221 | 10.6% | |
| 4 | 159 | 7.6% | |
| 5 | 142 | 6.8% | |
| 6 | 106 | 5.1% | |
| 7 | 66 | 3.2% | |
| 8 | 64 | 3.1% | |
| 9 | 31 | 1.5% |
| Value | Count | Frequency (%) | |
| 650 | 1 | < 0.1% | |
| 600 | 2 | 0.1% | |
| 535 | 1 | < 0.1% | |
| 450 | 1 | < 0.1% | |
| 345 | 1 | < 0.1% | |
| 327 | 1 | < 0.1% | |
| 244 | 1 | < 0.1% | |
| 237 | 1 | < 0.1% | |
| 225 | 1 | < 0.1% | |
| 220 | 1 | < 0.1% |
NewExist
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 KiB |
| 1 | |
|---|---|
| 2 | 322 |
| 0 | 1 |
| Value | Count | Frequency (%) | |
| 1 | 1770 | 84.6% | |
| 2 | 322 | 15.4% | |
| 0 | 1 | < 0.1% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 0 | 2094 | 33.3% | |
| . | 2093 | 33.3% | |
| 1 | 1770 | 28.2% | |
| 2 | 322 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 4186 | 66.7% | |
| Other Punctuation | 2093 | 33.3% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 0 | 2094 | 50.0% | |
| 1 | 1770 | 42.3% | |
| 2 | 322 | 7.7% |
Most frequent Other Punctuation characters
| Value | Count | Frequency (%) | |
| . | 2093 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 6279 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 0 | 2094 | 33.3% | |
| . | 2093 | 33.3% | |
| 1 | 1770 | 28.2% | |
| 2 | 322 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 6279 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 0 | 2094 | 33.3% | |
| . | 2093 | 33.3% | |
| 1 | 1770 | 28.2% | |
| 2 | 322 | 5.1% |
| Distinct count | 43 |
|---|---|
| Unique (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.5551839464882944 |
|---|---|
| Minimum | 0 |
| Maximum | 130 |
| Zeros | 1226 |
| Zeros (%) | 58.6% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 10.4 |
| Maximum | 130 |
| Range | 130 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 8.026333221 |
|---|---|
| Coefficient of variation (CV) | 3.141195855 |
| Kurtosis | 81.58491365 |
| Mean | 2.555183946 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.799476199 |
| Sum | 5348 |
| Variance | 64.42202498 |
| Value | Count | Frequency (%) | |
| 0 | 1226 | 58.6% | |
| 1 | 226 | 10.8% | |
| 2 | 198 | 9.5% | |
| 3 | 88 | 4.2% | |
| 4 | 78 | 3.7% | |
| 5 | 68 | 3.2% | |
| 6 | 31 | 1.5% | |
| 10 | 30 | 1.4% | |
| 8 | 20 | 1.0% | |
| 15 | 15 | 0.7% | |
| 20 | 14 | 0.7% | |
| 7 | 12 | 0.6% | |
| 9 | 11 | 0.5% | |
| 12 | 10 | 0.5% | |
| 25 | 8 | 0.4% | |
| 50 | 7 | 0.3% | |
| 11 | 5 | 0.2% | |
| 18 | 4 | 0.2% | |
| 13 | 4 | 0.2% | |
| 100 | 3 | 0.1% | |
| 75 | 3 | 0.1% | |
| 35 | 3 | 0.1% | |
| 21 | 2 | 0.1% | |
| 14 | 2 | 0.1% | |
| 19 | 2 | 0.1% | |
| Other values (18) | 23 | 1.1% |
| Value | Count | Frequency (%) | |
| 0 | 1226 | 58.6% | |
| 1 | 226 | 10.8% | |
| 2 | 198 | 9.5% | |
| 3 | 88 | 4.2% | |
| 4 | 78 | 3.7% | |
| 5 | 68 | 3.2% | |
| 6 | 31 | 1.5% | |
| 7 | 12 | 0.6% | |
| 8 | 20 | 1.0% | |
| 9 | 11 | 0.5% |
| Value | Count | Frequency (%) | |
| 130 | 1 | < 0.1% | |
| 100 | 3 | 0.1% | |
| 75 | 3 | 0.1% | |
| 69 | 1 | < 0.1% | |
| 65 | 1 | < 0.1% | |
| 63 | 1 | < 0.1% | |
| 60 | 1 | < 0.1% | |
| 50 | 7 | 0.3% | |
| 45 | 1 | < 0.1% | |
| 40 | 2 | 0.1% |
| Distinct count | 62 |
|---|---|
| Unique (%) | 3.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.814620162446249 |
|---|---|
| Minimum | 0 |
| Maximum | 535 |
| Zeros | 515 |
| Zeros (%) | 24.6% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 20 |
| Maximum | 535 |
| Range | 535 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 19.01559278 |
|---|---|
| Coefficient of variation (CV) | 3.27030696 |
| Kurtosis | 350.2461633 |
| Mean | 5.814620162 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 15.55617554 |
| Sum | 12170 |
| Variance | 361.5927689 |
| Value | Count | Frequency (%) | |
| 0 | 515 | 24.6% | |
| 1 | 362 | 17.3% | |
| 2 | 310 | 14.8% | |
| 3 | 175 | 8.4% | |
| 4 | 131 | 6.3% | |
| 5 | 110 | 5.3% | |
| 6 | 75 | 3.6% | |
| 8 | 56 | 2.7% | |
| 10 | 51 | 2.4% | |
| 7 | 45 | 2.2% | |
| 12 | 38 | 1.8% | |
| 15 | 20 | 1.0% | |
| 20 | 19 | 0.9% | |
| 9 | 18 | 0.9% | |
| 11 | 18 | 0.9% | |
| 13 | 17 | 0.8% | |
| 16 | 13 | 0.6% | |
| 14 | 11 | 0.5% | |
| 30 | 8 | 0.4% | |
| 17 | 8 | 0.4% | |
| 25 | 5 | 0.2% | |
| 21 | 5 | 0.2% | |
| 18 | 5 | 0.2% | |
| 19 | 4 | 0.2% | |
| 22 | 4 | 0.2% | |
| Other values (37) | 70 | 3.3% |
| Value | Count | Frequency (%) | |
| 0 | 515 | 24.6% | |
| 1 | 362 | 17.3% | |
| 2 | 310 | 14.8% | |
| 3 | 175 | 8.4% | |
| 4 | 131 | 6.3% | |
| 5 | 110 | 5.3% | |
| 6 | 75 | 3.6% | |
| 7 | 45 | 2.2% | |
| 8 | 56 | 2.7% | |
| 9 | 18 | 0.9% |
| Value | Count | Frequency (%) | |
| 535 | 1 | < 0.1% | |
| 327 | 1 | < 0.1% | |
| 244 | 1 | < 0.1% | |
| 220 | 1 | < 0.1% | |
| 150 | 2 | 0.1% | |
| 130 | 2 | 0.1% | |
| 102 | 1 | < 0.1% | |
| 100 | 1 | < 0.1% | |
| 88 | 1 | < 0.1% | |
| 85 | 2 | 0.1% |
| Distinct count | 33 |
|---|---|
| Unique (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1970.1920688007644 |
|---|---|
| Minimum | 0 |
| Maximum | 89658 |
| Zeros | 577 |
| Zeros (%) | 27.6% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 89658 |
| Range | 89658 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 11263.23066 |
|---|---|
| Coefficient of variation (CV) | 5.7168186 |
| Kurtosis | 37.09410803 |
| Mean | 1970.192069 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.105100645 |
| Sum | 4123612 |
| Variance | 126860365 |
| Value | Count | Frequency (%) | |
| 1 | 1437 | 68.7% | |
| 0 | 577 | 27.6% | |
| 15710 | 10 | 0.5% | |
| 68905 | 8 | 0.4% | |
| 68060 | 7 | 0.3% | |
| 88407 | 6 | 0.3% | |
| 69149 | 6 | 0.3% | |
| 37481 | 5 | 0.2% | |
| 18000 | 3 | 0.1% | |
| 69560 | 3 | 0.1% | |
| 68880 | 3 | 0.1% | |
| 67219 | 3 | 0.1% | |
| 79897 | 2 | 0.1% | |
| 1357 | 2 | 0.1% | |
| 10533 | 2 | 0.1% | |
| 28236 | 2 | 0.1% | |
| 33747 | 1 | < 0.1% | |
| 49952 | 1 | < 0.1% | |
| 26685 | 1 | < 0.1% | |
| 10465 | 1 | < 0.1% | |
| 41298 | 1 | < 0.1% | |
| 44890 | 1 | < 0.1% | |
| 43571 | 1 | < 0.1% | |
| 78760 | 1 | < 0.1% | |
| 88905 | 1 | < 0.1% | |
| Other values (8) | 8 | 0.4% |
| Value | Count | Frequency (%) | |
| 0 | 577 | 27.6% | |
| 1 | 1437 | 68.7% | |
| 1357 | 2 | 0.1% | |
| 10465 | 1 | < 0.1% | |
| 10533 | 2 | 0.1% | |
| 15100 | 1 | < 0.1% | |
| 15710 | 10 | 0.5% | |
| 18000 | 3 | 0.1% | |
| 18160 | 1 | < 0.1% | |
| 26685 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 89658 | 1 | < 0.1% | |
| 88905 | 1 | < 0.1% | |
| 88902 | 1 | < 0.1% | |
| 88407 | 6 | 0.3% | |
| 87075 | 1 | < 0.1% | |
| 81800 | 1 | < 0.1% | |
| 79897 | 2 | 0.1% | |
| 78760 | 1 | < 0.1% | |
| 69560 | 3 | 0.1% | |
| 69149 | 6 | 0.3% |
UrbanRural
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 KiB |
| 1 | |
|---|---|
| 0 | 230 |
| 2 | 127 |
| Value | Count | Frequency (%) | |
| 1 | 1736 | 82.9% | |
| 0 | 230 | 11.0% | |
| 2 | 127 | 6.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 1736 | 82.9% | |
| 0 | 230 | 11.0% | |
| 2 | 127 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2093 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 1736 | 82.9% | |
| 0 | 230 | 11.0% | |
| 2 | 127 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2093 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 1736 | 82.9% | |
| 0 | 230 | 11.0% | |
| 2 | 127 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2093 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 1736 | 82.9% | |
| 0 | 230 | 11.0% | |
| 2 | 127 | 6.1% |
RevLineCr
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 KiB |
| 1 | |
|---|---|
| 0 | |
| 2 | |
| 3 | 53 |
| Value | Count | Frequency (%) | |
| 1 | 733 | 35.0% | |
| 0 | 729 | 34.8% | |
| 2 | 578 | 27.6% | |
| 3 | 53 | 2.5% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 1 | 733 | 35.0% | |
| 0 | 729 | 34.8% | |
| 2 | 578 | 27.6% | |
| 3 | 53 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 2093 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 1 | 733 | 35.0% | |
| 0 | 729 | 34.8% | |
| 2 | 578 | 27.6% | |
| 3 | 53 | 2.5% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 2093 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 1 | 733 | 35.0% | |
| 0 | 729 | 34.8% | |
| 2 | 578 | 27.6% | |
| 3 | 53 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 2093 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 1 | 733 | 35.0% | |
| 0 | 729 | 34.8% | |
| 2 | 578 | 27.6% | |
| 3 | 53 | 2.5% |
LowDoc
Real number (ℝ≥0)
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0234113712374582 |
|---|---|
| Minimum | 0 |
| Maximum | 4 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 4 |
| Range | 4 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1719487379 |
|---|---|
| Coefficient of variation (CV) | 0.16801527 |
| Kurtosis | 86.20940044 |
| Mean | 1.023411371 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.1530416 |
| Sum | 2142 |
| Variance | 0.02956636846 |
| Value | Count | Frequency (%) | |
| 1 | 2047 | 97.8% | |
| 2 | 41 | 2.0% | |
| 3 | 3 | 0.1% | |
| 4 | 1 | < 0.1% | |
| 0 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 2047 | 97.8% | |
| 2 | 41 | 2.0% | |
| 3 | 3 | 0.1% | |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4 | 1 | < 0.1% | |
| 3 | 3 | 0.1% | |
| 2 | 41 | 2.0% | |
| 1 | 2047 | 97.8% | |
| 0 | 1 | < 0.1% |
| Distinct count | 22 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2004.0363115145724 |
|---|---|
| Minimum | 1989 |
| Maximum | 2010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 1989 |
|---|---|
| 5-th percentile | 1995 |
| Q1 | 2003 |
| median | 2005 |
| Q3 | 2007 |
| 95-th percentile | 2008 |
| Maximum | 2010 |
| Range | 21 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.96008046 |
|---|---|
| Coefficient of variation (CV) | 0.001976052249 |
| Kurtosis | 2.982347253 |
| Mean | 2004.036312 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -1.647102511 |
| Sum | 4194448 |
| Variance | 15.68223725 |
| Value | Count | Frequency (%) | |
| 2007 | 405 | 19.4% | |
| 2006 | 387 | 18.5% | |
| 2005 | 222 | 10.6% | |
| 2004 | 218 | 10.4% | |
| 2003 | 189 | 9.0% | |
| 2002 | 139 | 6.6% | |
| 2001 | 88 | 4.2% | |
| 2008 | 86 | 4.1% | |
| 2009 | 49 | 2.3% | |
| 1999 | 45 | 2.2% | |
| 2000 | 40 | 1.9% | |
| 2010 | 40 | 1.9% | |
| 1998 | 28 | 1.3% | |
| 1997 | 23 | 1.1% | |
| 1991 | 21 | 1.0% | |
| 1995 | 21 | 1.0% | |
| 1996 | 19 | 0.9% | |
| 1990 | 17 | 0.8% | |
| 1992 | 16 | 0.8% | |
| 1993 | 15 | 0.7% | |
| 1994 | 13 | 0.6% | |
| 1989 | 12 | 0.6% |
| Value | Count | Frequency (%) | |
| 1989 | 12 | 0.6% | |
| 1990 | 17 | 0.8% | |
| 1991 | 21 | 1.0% | |
| 1992 | 16 | 0.8% | |
| 1993 | 15 | 0.7% | |
| 1994 | 13 | 0.6% | |
| 1995 | 21 | 1.0% | |
| 1996 | 19 | 0.9% | |
| 1997 | 23 | 1.1% | |
| 1998 | 28 | 1.3% |
| Value | Count | Frequency (%) | |
| 2010 | 40 | 1.9% | |
| 2009 | 49 | 2.3% | |
| 2008 | 86 | 4.1% | |
| 2007 | 405 | 19.4% | |
| 2006 | 387 | 18.5% | |
| 2005 | 222 | 10.6% | |
| 2004 | 218 | 10.4% | |
| 2003 | 189 | 9.0% | |
| 2002 | 139 | 6.6% | |
| 2001 | 88 | 4.2% |
| Distinct count | 1180 |
|---|---|
| Unique (%) | 56.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 243108.55327281414 |
|---|---|
| Minimum | 4835 |
| Maximum | 2315000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 4835 |
|---|---|
| 5-th percentile | 10000 |
| Q1 | 40212 |
| median | 100000 |
| Q3 | 300000 |
| 95-th percentile | 1000000 |
| Maximum | 2315000 |
| Range | 2310165 |
| Interquartile range (IQR) | 259788 |
Descriptive statistics
| Standard deviation | 338593.0288 |
|---|---|
| Coefficient of variation (CV) | 1.392764772 |
| Kurtosis | 6.248458527 |
| Mean | 243108.5533 |
| Median Absolute Deviation (MAD) | 75000 |
| Skewness | 2.36560185 |
| Sum | 508826202 |
| Variance | 1.146452392e+11 |
| Value | Count | Frequency (%) | |
| 50000 | 118 | 5.6% | |
| 25000 | 71 | 3.4% | |
| 100000 | 67 | 3.2% | |
| 10000 | 67 | 3.2% | |
| 150000 | 37 | 1.8% | |
| 35000 | 35 | 1.7% | |
| 15000 | 28 | 1.3% | |
| 5000 | 27 | 1.3% | |
| 20000 | 24 | 1.1% | |
| 30000 | 21 | 1.0% | |
| 200000 | 19 | 0.9% | |
| 250000 | 15 | 0.7% | |
| 300000 | 14 | 0.7% | |
| 60000 | 14 | 0.7% | |
| 1000000 | 14 | 0.7% | |
| 75000 | 14 | 0.7% | |
| 500000 | 13 | 0.6% | |
| 45000 | 12 | 0.6% | |
| 65000 | 11 | 0.5% | |
| 80000 | 10 | 0.5% | |
| 40000 | 10 | 0.5% | |
| 55000 | 10 | 0.5% | |
| 105000 | 8 | 0.4% | |
| 135000 | 8 | 0.4% | |
| 70000 | 8 | 0.4% | |
| Other values (1155) | 1418 | 67.7% |
| Value | Count | Frequency (%) | |
| 4835 | 1 | < 0.1% | |
| 4999 | 1 | < 0.1% | |
| 5000 | 27 | 1.3% | |
| 5292 | 1 | < 0.1% | |
| 5600 | 1 | < 0.1% | |
| 6000 | 3 | 0.1% | |
| 6861 | 1 | < 0.1% | |
| 7000 | 1 | < 0.1% | |
| 7319 | 1 | < 0.1% | |
| 7500 | 7 | 0.3% |
| Value | Count | Frequency (%) | |
| 2315000 | 1 | < 0.1% | |
| 2000000 | 6 | 0.3% | |
| 1999000 | 1 | < 0.1% | |
| 1944300 | 1 | < 0.1% | |
| 1899500 | 1 | < 0.1% | |
| 1836000 | 1 | < 0.1% | |
| 1799000 | 1 | < 0.1% | |
| 1682000 | 1 | < 0.1% | |
| 1665000 | 2 | 0.1% | |
| 1656500 | 1 | < 0.1% |
MIS_Status
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.5 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 1408 | 67.3% | |
| 1 | 685 | 32.7% |
| Distinct count | 613 |
|---|---|
| Unique (%) | 29.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20110.022933588152 |
|---|---|
| Minimum | 0 |
| Maximum | 1509550 |
| Zeros | 1398 |
| Zeros (%) | 66.8% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 16332 |
| 95-th percentile | 82781.2 |
| Maximum | 1509550 |
| Range | 1509550 |
| Interquartile range (IQR) | 16332 |
Descriptive statistics
| Standard deviation | 75584.08943 |
|---|---|
| Coefficient of variation (CV) | 3.758528256 |
| Kurtosis | 144.9703881 |
| Mean | 20110.02293 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.27062743 |
| Sum | 42090278 |
| Variance | 5712954575 |
| Value | Count | Frequency (%) | |
| 0 | 1398 | 66.8% | |
| 50000 | 24 | 1.1% | |
| 35000 | 17 | 0.8% | |
| 10000 | 9 | 0.4% | |
| 100000 | 9 | 0.4% | |
| 15000 | 5 | 0.2% | |
| 20000 | 4 | 0.2% | |
| 30000 | 3 | 0.1% | |
| 34000 | 3 | 0.1% | |
| 5000 | 3 | 0.1% | |
| 40000 | 3 | 0.1% | |
| 25000 | 3 | 0.1% | |
| 9800 | 2 | 0.1% | |
| 14840 | 2 | 0.1% | |
| 49500 | 2 | 0.1% | |
| 34500 | 2 | 0.1% | |
| 85000 | 2 | 0.1% | |
| 4331 | 2 | 0.1% | |
| 49950 | 2 | 0.1% | |
| 49750 | 2 | 0.1% | |
| 75000 | 2 | 0.1% | |
| 49077 | 2 | 0.1% | |
| 45000 | 2 | 0.1% | |
| 36988 | 1 | < 0.1% | |
| 38398 | 1 | < 0.1% | |
| Other values (588) | 588 | 28.1% |
| Value | Count | Frequency (%) | |
| 0 | 1398 | 66.8% | |
| 161 | 1 | < 0.1% | |
| 415 | 1 | < 0.1% | |
| 1360 | 1 | < 0.1% | |
| 1494 | 1 | < 0.1% | |
| 1547 | 1 | < 0.1% | |
| 1580 | 1 | < 0.1% | |
| 1963 | 1 | < 0.1% | |
| 2242 | 1 | < 0.1% | |
| 2415 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1509550 | 1 | < 0.1% | |
| 1255175 | 1 | < 0.1% | |
| 1058672 | 1 | < 0.1% | |
| 800054 | 1 | < 0.1% | |
| 776318 | 1 | < 0.1% | |
| 642404 | 1 | < 0.1% | |
| 634549 | 1 | < 0.1% | |
| 586026 | 1 | < 0.1% | |
| 573429 | 1 | < 0.1% | |
| 552478 | 1 | < 0.1% |
| Distinct count | 659 |
|---|---|
| Unique (%) | 31.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 233398.54706163402 |
|---|---|
| Minimum | 4500 |
| Maximum | 2350000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 4500 |
|---|---|
| 5-th percentile | 10000 |
| Q1 | 30000 |
| median | 63000 |
| Q3 | 300000 |
| 95-th percentile | 1000000 |
| Maximum | 2350000 |
| Range | 2345500 |
| Interquartile range (IQR) | 270000 |
Descriptive statistics
| Standard deviation | 343962.8197 |
|---|---|
| Coefficient of variation (CV) | 1.473714486 |
| Kurtosis | 6.06429949 |
| Mean | 233398.5471 |
| Median Absolute Deviation (MAD) | 50500 |
| Skewness | 2.344334224 |
| Sum | 488503159 |
| Variance | 1.183104213e+11 |
| Value | Count | Frequency (%) | |
| 50000 | 251 | 12.0% | |
| 10000 | 138 | 6.6% | |
| 25000 | 135 | 6.5% | |
| 100000 | 99 | 4.7% | |
| 35000 | 91 | 4.3% | |
| 30000 | 79 | 3.8% | |
| 20000 | 58 | 2.8% | |
| 15000 | 56 | 2.7% | |
| 150000 | 40 | 1.9% | |
| 5000 | 39 | 1.9% | |
| 40000 | 31 | 1.5% | |
| 45000 | 24 | 1.1% | |
| 60000 | 20 | 1.0% | |
| 200000 | 18 | 0.9% | |
| 250000 | 17 | 0.8% | |
| 75000 | 16 | 0.8% | |
| 1000000 | 16 | 0.8% | |
| 300000 | 15 | 0.7% | |
| 500000 | 15 | 0.7% | |
| 85000 | 9 | 0.4% | |
| 80000 | 9 | 0.4% | |
| 125000 | 8 | 0.4% | |
| 135000 | 8 | 0.4% | |
| 37000 | 7 | 0.3% | |
| 90000 | 7 | 0.3% | |
| Other values (634) | 887 | 42.4% |
| Value | Count | Frequency (%) | |
| 4500 | 1 | < 0.1% | |
| 5000 | 39 | 1.9% | |
| 5500 | 1 | < 0.1% | |
| 6000 | 5 | 0.2% | |
| 7000 | 1 | < 0.1% | |
| 7500 | 4 | 0.2% | |
| 8000 | 2 | 0.1% | |
| 8600 | 1 | < 0.1% | |
| 10000 | 138 | 6.6% | |
| 10500 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 2350000 | 1 | < 0.1% | |
| 2000000 | 6 | 0.3% | |
| 1999000 | 1 | < 0.1% | |
| 1944300 | 1 | < 0.1% | |
| 1899500 | 1 | < 0.1% | |
| 1836000 | 1 | < 0.1% | |
| 1799000 | 1 | < 0.1% | |
| 1682000 | 1 | < 0.1% | |
| 1665000 | 2 | 0.1% | |
| 1656500 | 1 | < 0.1% |
| Distinct count | 754 |
|---|---|
| Unique (%) | 36.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 189470.24175824175 |
|---|---|
| Minimum | 2250 |
| Maximum | 2115000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 16.5 KiB |
Quantile statistics
| Minimum | 2250 |
|---|---|
| 5-th percentile | 5000 |
| Q1 | 15000 |
| median | 42500 |
| Q3 | 240000 |
| 95-th percentile | 850000 |
| Maximum | 2115000 |
| Range | 2112750 |
| Interquartile range (IQR) | 225000 |
Descriptive statistics
| Standard deviation | 299244.2617 |
|---|---|
| Coefficient of variation (CV) | 1.579373409 |
| Kurtosis | 6.570152426 |
| Mean | 189470.2418 |
| Median Absolute Deviation (MAD) | 36125 |
| Skewness | 2.42137024 |
| Sum | 396561216 |
| Variance | 8.954712815e+10 |
| Value | Count | Frequency (%) | |
| 25000 | 230 | 11.0% | |
| 5000 | 116 | 5.5% | |
| 12500 | 109 | 5.2% | |
| 17500 | 84 | 4.0% | |
| 15000 | 75 | 3.6% | |
| 50000 | 73 | 3.5% | |
| 10000 | 52 | 2.5% | |
| 7500 | 45 | 2.2% | |
| 20000 | 30 | 1.4% | |
| 22500 | 27 | 1.3% | |
| 2500 | 25 | 1.2% | |
| 8500 | 22 | 1.1% | |
| 127500 | 21 | 1.0% | |
| 21250 | 20 | 1.0% | |
| 75000 | 19 | 0.9% | |
| 42500 | 15 | 0.7% | |
| 225000 | 14 | 0.7% | |
| 4250 | 13 | 0.6% | |
| 12750 | 12 | 0.6% | |
| 40000 | 12 | 0.6% | |
| 375000 | 12 | 0.6% | |
| 30000 | 11 | 0.5% | |
| 150000 | 10 | 0.5% | |
| 187500 | 10 | 0.5% | |
| 90000 | 10 | 0.5% | |
| Other values (729) | 1026 | 49.0% |
| Value | Count | Frequency (%) | |
| 2250 | 1 | < 0.1% | |
| 2500 | 25 | 1.2% | |
| 2750 | 1 | < 0.1% | |
| 3000 | 4 | 0.2% | |
| 3500 | 1 | < 0.1% | |
| 4000 | 2 | 0.1% | |
| 4250 | 13 | 0.6% | |
| 4300 | 1 | < 0.1% | |
| 4500 | 1 | < 0.1% | |
| 5000 | 116 | 5.5% |
| Value | Count | Frequency (%) | |
| 2115000 | 1 | < 0.1% | |
| 1999000 | 1 | < 0.1% | |
| 1836000 | 1 | < 0.1% | |
| 1799000 | 1 | < 0.1% | |
| 1682000 | 1 | < 0.1% | |
| 1655000 | 1 | < 0.1% | |
| 1620000 | 1 | < 0.1% | |
| 1583000 | 1 | < 0.1% | |
| 1512000 | 1 | < 0.1% | |
| 1500000 | 5 | 0.2% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Zip | ICS | ApprovalFY | Term | NoEmp | NewExist | CreateJob | RetainedJob | FranchiseCode | UrbanRural | RevLineCr | LowDoc | DisbursementDate | DisbursementGross | MIS_Status | ChgOffPrinGr | GrAppv | SBA_Appv | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 92801 | 532420 | 2001 | 36 | 1 | 1.0 | 0 | 0 | 1 | 0 | 1 | 1 | 2001 | 32812 | 0 | 0 | 30000 | 15000 |
| 1 | 90505 | 531210 | 2001 | 56 | 1 | 1.0 | 0 | 0 | 1 | 0 | 1 | 1 | 2003 | 30000 | 0 | 0 | 30000 | 15000 |
| 2 | 92103 | 531210 | 2001 | 36 | 10 | 1.0 | 0 | 0 | 1 | 0 | 1 | 1 | 2001 | 30000 | 0 | 0 | 30000 | 15000 |
| 3 | 92108 | 531312 | 2003 | 36 | 6 | 1.0 | 0 | 0 | 1 | 0 | 1 | 1 | 2003 | 50000 | 0 | 0 | 50000 | 25000 |
| 4 | 91345 | 531390 | 2006 | 240 | 65 | 1.0 | 3 | 65 | 1 | 1 | 0 | 1 | 2006 | 343000 | 0 | 0 | 343000 | 343000 |
| 5 | 95831 | 531210 | 2003 | 84 | 1 | 1.0 | 0 | 0 | 1 | 0 | 1 | 1 | 2003 | 55825 | 0 | 0 | 50000 | 25000 |
| 6 | 90255 | 531210 | 2006 | 269 | 2 | 1.0 | 0 | 2 | 1 | 1 | 0 | 1 | 2006 | 297500 | 1 | 247074 | 297500 | 223125 |
| 7 | 90808 | 531210 | 2006 | 84 | 1 | 2.0 | 2 | 1 | 1 | 2 | 1 | 1 | 2006 | 67047 | 0 | 0 | 30000 | 15000 |
| 8 | 92704 | 531390 | 2004 | 22 | 5 | 1.0 | 0 | 0 | 1 | 1 | 2 | 1 | 2004 | 50000 | 1 | 35333 | 50000 | 25000 |
| 9 | 94583 | 531320 | 2004 | 84 | 4 | 2.0 | 0 | 0 | 1 | 1 | 1 | 1 | 2004 | 17500 | 0 | 0 | 10000 | 5000 |
Last rows
| Zip | ICS | ApprovalFY | Term | NoEmp | NewExist | CreateJob | RetainedJob | FranchiseCode | UrbanRural | RevLineCr | LowDoc | DisbursementDate | DisbursementGross | MIS_Status | ChgOffPrinGr | GrAppv | SBA_Appv | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2083 | 92021 | 531390 | 2006 | 32 | 2 | 2.0 | 0 | 2 | 1 | 2 | 1 | 1 | 2006 | 76353 | 1 | 23594 | 30000 | 15000 |
| 2084 | 90640 | 531210 | 2006 | 54 | 2 | 2.0 | 0 | 0 | 1 | 1 | 1 | 1 | 2006 | 12950 | 1 | 9994 | 10000 | 5000 |
| 2085 | 92618 | 531390 | 2006 | 84 | 3 | 1.0 | 0 | 3 | 1 | 1 | 1 | 1 | 2006 | 105766 | 0 | 0 | 100000 | 50000 |
| 2086 | 91902 | 531210 | 2006 | 84 | 3 | 1.0 | 0 | 3 | 1 | 2 | 1 | 1 | 2006 | 92502 | 0 | 0 | 30000 | 15000 |
| 2087 | 95112 | 531210 | 2006 | 240 | 6 | 1.0 | 4 | 6 | 1 | 1 | 0 | 1 | 2007 | 721000 | 0 | 0 | 721000 | 721000 |
| 2088 | 91331 | 532310 | 2006 | 240 | 28 | 1.0 | 8 | 28 | 1 | 1 | 0 | 1 | 2006 | 1029000 | 0 | 0 | 1029000 | 1029000 |
| 2089 | 92346 | 532230 | 2006 | 60 | 5 | 2.0 | 0 | 5 | 1 | 1 | 0 | 1 | 2006 | 150000 | 0 | 0 | 150000 | 75000 |
| 2090 | 92021 | 532120 | 1997 | 300 | 4 | 1.0 | 0 | 0 | 1 | 0 | 0 | 1 | 1997 | 99000 | 0 | 0 | 99000 | 79200 |
| 2091 | 93012 | 532120 | 1997 | 84 | 2 | 1.0 | 0 | 0 | 1 | 0 | 0 | 1 | 1997 | 50000 | 0 | 0 | 50000 | 40000 |
| 2092 | 91352 | 532120 | 1997 | 120 | 3 | 1.0 | 0 | 0 | 1 | 0 | 0 | 1 | 1997 | 251150 | 0 | 0 | 500000 | 375000 |